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Abstract 

The present paper reviews issues involved in converting continuous variables to nominal 
variables to be used in OVA techniques. The literature dealing with the dangers of 
dichotomizing continuous variables is reviewed. First, the assumptions invoked by OVA 
analyses are reviewed in addition to concerns regarding the loss of variance and a 
reduction in score reliability that result from the conversion of continuous variables to 
nominal variables. Second, regression is discussed as a more adequate alternative to 
ANOVA. Finally, a heuristic data set is presented and used as a demonstration of three 
alternatives that can occur through this conversion and when ANOVA is employed. 
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Mutilating Data and Discarding Variance: The Dangers of Dichotomizing Continuous 

Variables 

OVA techniques were historically the dominant method used in the social 
sciences (Edgington, 1964, 1974). Thompson (1985, 1986, 1994, 1999) has argued, 
however, that regression and other GLM techniques have recently become more 
prevalent while the proportion of OVA methods has declined (cf. Elmore & Woehike, 
1988; Goodwin & Goodwin, 1985; Stone-Romero, Weaver, and Glenar, 1995). One 
reason for this shift is offered by Thompson (1999): 

Improved analytic choices have partially been a function of growing researcher 
awareness that: The researcher’s fundamental task in deriving defensible results is 
to employ an analytic model that matches the researcher’s (too often implicit) 
model of reality, (p. 24 ) 

Thompson (1986) offered three potential weaknesses oftentimes involved in 
ANOVA. First, the requirement that continuous data be converted to nominal data may 
result in information being “squandered.” Most variables have scales that are higher than 
nominal, a notable exception being gender. The reality of these variables (as continuous) 
is lost as they are modified (as nominal) for analytic purposes. The result is an analytic 
model that no longer matches the model of reality. 

Second, the relationships and distribution shapes of some variables may become 
distorted through OVA methods (Thompson, 1986). The “balanced design,” preferred by 
researchers employing OVA methods, is not typically indicative of reality. While having 
equal number of participants in each cell is advantageous for OVA analysis because 
effects are then uncorrelated, a high price is paid, as explained by Thompson (1986), “as 
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subjects rarely come or stay packaged in equal or proportional numbers per cell, 
researchers may induce further mortality in order to achieve balanced design” (p. 919). 
Furthermore, predictors are assumed to be normally distributed and perfectly 
uncorrelated in OVA analyses. In reality, however, the predictors may be normally 
distributed and may be somewhat correlated with each other (Thompson, 1986). 

Third, the reliability of the predictors (that were originally continuous) decreases 
as they are converted to a nominal scale. Reliability is basically a function of variance. It 
follows that discarding information and its subsequent loss of variance (through 
conversion from continuous to nominal data) will result in a decrease in reliability. The 
result is a reduction in power against Type II errors (Cohen, 1968, Thompson, 1986, 
Thompson, 1999). A summation of problems inherent in ANOVA is offered by 
Humphreys (1978): 

To summarize the defects in the analysis of variance design, primarily there is a 
substantial loss in the power of the statistical test of the research hypothesis. 

There is often, as well, an illusion of control of variables and a tendency to 
interpret significant relationships causally. Also, when orthogonality is imposed 
The purpose of this paper is to review the effects of converting continuous 
variables into ones that are dichotomized (or trichotomized). First, the nature of 
dichotomized variables is discussed. Next, the inherent problems that result from 
converting continuous variables to dichotomous ones are reviewed. Next, regression is 
discussed as an alternative to ANOVA. Finally, heuristic examples are offered that 
demonstrate these effects and how they alter the true nature of the original data. 




5 



Dichotomizing Continuous Variables 5 

Dichotomous Variables 

A dichotomous variable is “a discrete variable with only two classifications” 
(Hinkle, Wiersma, & Jurs, 1998, p. 617). It is discrete in that the number of possible 
classifications can be counted (in this case- two). A dichotomous variable is one example 
of a categorical variable. A categorical variable is “one in which subjects differ in type 
or kind” (Kerlinger & Pedhazur, 1973, p. 102). Examples of categorical variables 
include race, biological sex, and marital status. Biological sex is also an example of a 
dichotomous variable in that there are exactly two classifications. 

A continuous variable, on the other hand, is one “that can take on any value in the 
measurement scale being used” (Hinkle, Wiersma, & Jurs, 1998, p. 617). Subjects differ 
on continuous variables in amount or degree (Kerlinger & Pedhazur, 1973). Examples of 
continuous variables include intelligence, dosages of a drug, and frequency of 
reinforcement. 

To dichotomize (or trichotomize) a continuous variable, which is required for any 
OVA analysis, one must divide the scale or results of the continuous variable into a fixed 
number of classifications (two for a dichotomy and three for a trichotomy). An example 
would be to take scores from an intelligence test and divide them into two or three 
groups. In this manner, the original scores take on new meaning as part of a new group 
with “similar” scores. Each group represents a level in a particular “way” for the 
purposes of ANOVA. In this case, a number of levels (determined by the number of 
groups created by the researcher) of the way- intelligence- are created. 
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A major problem inherent in converting continuous data to nominal (categorical) 
data is the loss of information (Kerlinger, 1986; Pedhazur, 1982). As Thompson (1981) 
emphasized: 

When we reduce interval level of scale data to the nominal level of scale we are 
doing nothing less than thoughtlessly throwing away information which we 
previously went to some trouble to collect. If research is conducted for the 
purpose of acquiring knowledge, then is it consistent with our purpose to employ 
a method which “throws away” information which might provide a more refined 
understanding of the phenomena which we are studying? (p. 8) 

This “throwing away of information” occurs in the case of dichotomized or 
trichotomized variables that were originally continuous because of two basic assumptions 
of dichotomized data. As explained by Thompson (1999), these assumptions are: 

1- All participants in a given level of a way are considered to be the same. 

2- All participants in a given level of a specific way are considered to be 
different from those in different levels of that way. 

For example, if a researcher has a somewhat normal distribution of cholesterol 
scores and trichotomizes them at 125 and 175, he/she is saying that a person with a score 
of 130 is the same as a person with a score of 165. In addition, a person with a score of 
173 is different than a person with a score of 177. Essentially, scores on a continuous 
variable (cholesterol scores) have been reduced to three levels of a trichotomized way 
(cholesterol way). A given person’s cholesterol score, therefore, can belong to only one 
level and is equally different from all of the scores in the other two levels. This analytic 
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model created by converting the continuous data to nominal data is seemingly not 
indicative of reality. 

Another major problem of dichotomizing variables is a reduction of reliability. 
Furthermore, as described by (Thompson, 1999): 

In such cases, it is the reliability of the dichotomy that we actually analyze, and 
not the reliability of the highly-reliable, intervally-scaied data that we originaiiy 
collected, which impact the analysis we are conducting, (p. 27, emphasis in 
original) 

In other words, the reduction of a reliable variable to a dichotomous variable 
actually makes that variable less reliable (Cliff, 1987; Humphreys, 1978). This reduction 
results in less power against Type II error (Cohen, 1968; Thompson, 1986). The 
reduction in reliability occurs as emphasis shifts from the reliable data itself to how 
reliably the data was dichotomized or trichotomized. In the above example the researcher 
would no longer be analyzing the reliable scores of 130, 165, etc. but would instead be 
analyzing the new scores created by the trichotomized groups. In this example, the three 
groups (levels) would consist of those scores less than 125, those between 125 and 175, 
and those greater than 175. 

As a heuristic example, consider a study by Song and Parry (1999) regarding 
product innovativeness and its relationship with various organizational synergies and 
product performance. In this study, Song and Parry collected data on 788 products from 
404 firms. These firms responded to 7 measures of product innovativeness on scales of 0 
to 10 (10 indicative of “very innovative”). The results are reproduced in Table 1. 
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As seen in Table 1, the majority of products (561) were rated between 2.5 and 7.5 
on the averaged scale of innovativeness. Only 32 were rated between 0 and 2.5 while 
195 were rated between 7.5 and 10. At this point the authors decided to dichotimize the 
results into “low-innovatiness” and “high-innovativeness” making the split at the score 
mean of 5.99. By making this split, the authors essentially forced the assumption that all 
scores between 0 and 5.99 are the same and that all scores between 6.0 and 10.0 are also 
the same. This seems hardly the case given the initial breakdown of results in Table 1. 

This dichotimization has also forced the assumption that all scores in the “low- 
innovativeness” category are equally different from those in the “high-innovativeness” 
category. In other words, the difference between a score of 5 and 6 is the same as the 
difference between 2 and 9. Again, this does not seem to match the reality of the data. 

Parry and Song (1999) continued their analysis by running an ANOVA with the 
two levels of innovativeness against six organizational and performance variables. A 
seemingly better alternative would have been to run a regression for each of the six 
dependent variables with innovativeness retained as a continuous independent variable. 
The results would then have been based on the reliability of the original data and not on 
the reliability of the dichotomy, which in this case, was seemingly less. 

Regression can serve as a replacement for ANOVA because, as noted by Cohen 
(1968), ANOVA and ANCOVA are special cases of multiple regression. Furthermore, as 
pointed out emphatically by Cohen and Cohen (1983), “it (multiple regression/correlation 
analysis) is a versatile, all-purpose system for analyzing the data of the behavioral, social, 
and biological sciences and technologies” (p. 4, emphasis added). 
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Kerlinger and Pedhazur (1973) added: 

...(multiple regression) can be used equally well in experimental or non- 
experimental research. It can handle continuous and categorical variables. It can 
handle two, three, four, or more independent variables... Finally... multiple 
regression analysis can do anything the analysis of variance does- sum of squares, 
mean squares, F ratios- and more. (p. 3, emphasis added) 

Three Heuristic Scenarios 

Whether dichotomizing or trichotomizing intervally-scaled date is appropriate 
depends on the situation. Specifically, it is dependent on whether the converted data 
meets the two assumptions of the new model of reality: First, is each member of a 
particular group essentially the same, and second, are all members of one group 
essentially different from the members of the other groups? 

There are three possible situations that can influence the decision to convert 
intervally-scaled data to nominally-scaled data. These three alternatives are 
demonstrated using the heuristic data in Table 2. 

Alternative one- Impact is minimal. In this scenario, the data in Table 2 for XI 
are re-expressed as a trichotomy under XI . This does not appear problematic in this 
case as the grouping of scores maintains the general variability of the original data set. In 
other words, the assumptions invoked by the trichotomizing of the data are consistent 
with original data: The scores within each group are basically the same, and the scores 
within each group differ equally from the scores in the other groups. The analytic model 
set up by the researcher matches the model of reality in the original scores. Graphically, 
this can be seen in Figures 1 and 2. In this case, the variance in the original data is 
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maintained in the trichotomy, and the results, therefore, will be representative of the 
original data. 

Alternative two- Variance is created where there is none. This situation occurs 
when intervally-scaled data are converted without clear differences between the original 
scores. The data in X2 found in Table 2, for example, are clearly lacking variance 
between the scores. The conversion of this data however, as shown in X2’, would 
represent an analytical model purporting a difference between groups that does not match 
the model of reality in which the difference is meaningless. 

As evidence of the creation of imaginary variance, Thompson (1994) refers to a 
dissertation in which the author assigned children to one of three depression groups based 
on their scores on a depression variable. The top score of all of the children was 3.43. 

As noted by the author of the dissertation, the recommended lower-end cutoff for 
severely depressed people is 4. In other words, the author created three groups of 
depressed children when none of them met the minimum score for being clinically 
depressed ! 

Alternative three- Variance is discarded and shape is distorted. As discussed 
previously, dichotimizing or trichotimizing continuous data invokes its assumptions of 
equality within groups and difference between groups regardless of the shape of the 
original data. X3 in Table 2, for example, is somewhat normally distributed. This is 
illustrated graphically in Figure 3. The conversion of the data in X3’ substantially alters 
this distribution as illustrated in Figure 4. The results of a regression on X3 and an 
ANOVA on X3’ are presented in Figure 5. As seen in the results of the regression, the 
results are statistically significant (pcalculated = .0001) with an R 2 of .555. The results 




11 



Dichotomizing Continuous Variables 11 



from the ANOVA on the modified data (X3’), however, are not statistically significant 
(Pcalculated = .1 15) and the effect size, measured by eta 2 , dropped to .214. Clearly, this 
is evidence that “statistical tests based upon continuous distributions are more powerful 



than those based upon dichotomized (or trichotimized) distributions” (Humphreys & 
Fleishman, 1974, p. 468, italics added). In this scenario, the re-expression of the original 



data has discarded a substantial amount of variance, which can mean 



“nonsignificant 



results when in fact the tested results may be significant” (Kerlinger & Pedhazur, 1973, p. 



8 ). 

A strategy sometimes resorted to after trichotomizing data is to drop the middle 
group and run analyses only with the extreme “high” and “low” groups (Humphreys, 
1978). If this is done on the data in X3’ by eliminating the subjects with scores of 2, we 
obtain the results illustrated in Figure 6. Statistical significance is reached (Pcalculated 
= .05) and eta 2 becomes .283. While these results appear to be “better” than the original 
ANOVA results, they are still lower than those produced by regression on the original X3 
data. These results support the explanation of Humphreys (1978) regarding the 
elimination of middle groups, “While power can be increased after categorizing by 
discarding a category or categories from the center of the distribution, the increase does 

not compensate for the initial loss” (p. 873). 

Humphreys (1978) goes on to explain another critical effect of eliminating middle 

groups, “as a side effect, the inflation of the differences between means when extreme 
groups are used frequently inflates the experimenter’s evaluation of the importance of 
those differences” (p. 873). In other words, the experimenter is only looking at the 
extreme values from the trichotomized data. And, this occurs after converting the 
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original continuous data to nominal, trichotomized data! Without considering the scores 
in the middle, there may be a perceived importance of the variable as there is a large 
difference between the extremely low and extremely high scores. Again, in this case a 
regression on the original data would be more appropriate as it would prevent the loss of 
information due to eliminating or “mutilating” the original data. 

Conclusion 

The loss of variance that occurs when continuous or interval data is converted to 
nominal data is a major concern when considering the use of ANOVA as an analytic tool. 
This paper has reviewed specific concerns that should be considered prior to using OVA 
techniques on continuous predictor variables. The loss of information, reduction in 
reliability, and assumptions invoked by ANOVA are reasons why correlational 
techniques such as regression more fully preserve the nature of the original data. As 
Pedhazur (1982) emphasized: 

Categorization of attribute variables is all too frequently resorted to in the social 
sciences. . It is possible that some of the conflicting evidence in the research 
literature of a given area may be attributed to the practice of categorization of 
continuous variables... Categorization leads to a loss of information, and 
consequently to a less sensitive analysis, (pp. 452-453) 
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Table 1 

Breakdown of project scores 

Number of Projects 
0-2.S 2.5-7.S 7,5-10 

7-item scale 32 561 195 

Note. From “Challenges of Managing the Development of Breakthrough Products in 
Japan,” by X. Michael Song and Mark E. Parry, 1999, Journal of Operations 
Management, 17, p. 675. 



Table 2 

Heuristic Data Set 



Predictors 



Id 


Y 


XI 


XV 


X2 


X2’ 


X3 


X3’ 


1 


295 


499 


1 


599 


1 


584 


1 


2 


297 


499 


1 


599 


1 


584 


1 


3 


299 


500 


1 


599 


1 


589 


1 


4 


299 


500 


1 


599 


1 


589 


1 


5 


299 


500 


1 


599 


1 


589 


1 


6 


301 


501 


1 


599 


1 


596 


1 


7 


303 


501 


1 


599 


1 


596 


1 


8 


296 


599 


2 


600 


2 


597 


2 


9 


298 


599 


2 


600 


2 


598 


2 


10 


300 


600 


2 


600 


2 


600 


2 


11 


300 


600 


2 


600 


2 


600 


2 


12 


300 


600 


2 


600 


2 


600 


2 


13 


302 


601 


2 


600 


2 


602 


2 


14 


304 


601 


2 


600 


2 


603 


2 


15 


298 


699 


3 


601 


3 


604 


3 


16 


300 


699 


3 


601 


3 


604 


3 


17 


302 


700 


3 


601 


3 


611 


3 


18 


302 


700 


3 


601 


3 


611 


3 


19 


302 


700 


3 


601 


3 


611 


3 


20 


304 


701 


3 


601 


3 


616 


3 


21 


306 


701 


3 


601 


3 


616 


3 



Note. XV, XI ' , and X3’ are the re-expressions in nominal score form of their intervally 



scaled variable counterparts. 
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Figure Captions 

Figurel. Distribution of scores for variable XL 
Figure 2. Distribution of scores for variable XL. 

Figure 3. Distribution of scores for X3. 

Figure 4. Distribution of scores for X3’. 

Figure 5. Regression (Y, X3) and ANOVA (Y, X3 ) of Table 2 data. 
Figure 6. ANOVA (Y, X3’) with middle group deleted. 
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8 




500.0 525.0 550.0 575.0 600.0 625.0 650.0 675.0 700.0 



Std. Dev = 83.67 
Mean = 600.0 
N = 21.00 



XI 
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8 



6 



4 



2 



0 



2.00 



1.50 



1.00 



X3P 



I Std. Dev = .84 
Mean = 2.00 
N = 21.00 

2.50 3.00 
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Regression (Y, X3) 



Model Summary 



Model 


R 


R Square 


Adjusted 
R Square 


Std. Error of 
the Estimate 


1 


. 745 a 


.555 


.532 


1.8900 



a. Predictors: (Constant), X3 



ANOVA 3 



Model 


Sum of 
Squares 


df 


Mean Square 


F . 


Sig. 


1 Regression 


84.796 


1 


84.796 


23.738 


,000 a 


Residual 


67.870 


19 


3.572 






Total 


152.667 


20 









a. Predictors: (Constant), X3 

b. Dependent Variable: Y 



ANOVA (Y, X3’) 

ANOVA 



Y 





Sum of 
Squares 


df 


Mean Square 


F 


Sig. 


Between Groups 


32.667 


2 


16.333 


2.450 


.115 


Within Groups 


120.000 


18 


6.667 






Total 


152.667 


20 









Note. Using an Excel function (i.e., “=FDIST (f, dfl, df2)” - “-FDIST (23.738, 1, 19) , 
the exact Pcalculated value was found to be .000106. For the ANOVA, eta was 
computed to be 21.39% (32.6667 / 152.6667). 
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ANOVA 



Y 





Sum of 
Squares 


df 


Mean Square 


F 


Siq. 


Between Groups 


31.500 


1 


31.500 


4.725 


.050 


Within Groups 


80.000 


12 


6.667 






Total 


111.500 


13 









Note. Forthe ANOVA, eta 2 was computed to be 28.25% (31.50 / 111.50). 
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